Spectrum Reconstruction of Atmospheric Neutrinos with Unfolding Techniques Juande Zornoza UW Madison.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

CHAPTER 8 More About Estimation. 8.1 Bayesian Estimation In this chapter we introduce the concepts related to estimation and begin this by considering.
CS479/679 Pattern Recognition Dr. George Bebis
2 – In previous chapters: – We could design an optimal classifier if we knew the prior probabilities P(wi) and the class- conditional probabilities P(x|wi)
London Collaboration Meeting September 29, 2005 Search for a Diffuse Flux of Muon Neutrinos using AMANDA-II Data from Jessica Hodges University.
B.Macukow 1 Lecture 12 Neural Networks. B.Macukow 2 Neural Networks for Matrix Algebra Problems.
RooUnfold unfolding framework and algorithms
JET CHARGE AT LHC Scott Lieberman New Jersey Institute of Technology TUNL REU 2013 Duke High Energy Physics Group Working with: Professor Ayana Arce Dave.
Data mining and statistical learning - lecture 6
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
Maximum likelihood (ML) and likelihood ratio (LR) test
A Search for Point Sources of High Energy Neutrinos with AMANDA-B10 Scott Young, for the AMANDA collaboration UC-Irvine PhD Thesis:
Probability theory 2011 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different definitions.
Maximum likelihood (ML) and likelihood ratio (LR) test
Statistical Image Modelling and Particle Physics Comments on talk by D.M. Titterington Glen Cowan RHUL Physics PHYSTAT05 Glen Cowan Royal Holloway, University.
Machine Learning CMPT 726 Simon Fraser University
Energy Reconstruction Algorithms for the ANTARES Neutrino Telescope J.D. Zornoza 1, A. Romeyer 2, R. Bruijn 3 on Behalf of the ANTARES Collaboration 1.
Linear and generalised linear models
Linear and generalised linear models
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Probability theory 2008 Outline of lecture 5 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different.
Maximum likelihood (ML)
Kirsten Münich Dortmund University Diffuse Limit with an unfolding method AMANDA Collaboration Meeting Berkeley, March 2005.
Chapter 15 Modeling of Data. Statistics of Data Mean (or average): Variance: Median: a value x j such that half of the data are bigger than it, and half.
Unfolding algorithms and tests using RooUnfold
23 rd to 24 th October 2006GIST 25, UK Met Office, Exeter In-flight Assessment of the end-to-end spectral responses of the GERB radiometer detectors Glyn.
Unfolding jet multiplicity and leading jet p T spectra in jet production in association with W and Z Bosons Christos Lazaridis University of Wisconsin-Madison.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Course 13 Curves and Surfaces. Course 13 Curves and Surface Surface Representation Representation Interpolation Approximation Surface Segmentation.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Progress in identification of damping: Energy-based method with incomplete and noisy data Marco Prandina University of Liverpool.
RooUnfold unfolding framework and algorithms Tim Adye Rutherford Appleton Laboratory ATLAS RAL Physics Meeting 20 th May 2008.
A statistical test for point source searches - Aart Heijboer - AWG - Cern june 2002 A statistical test for point source searches Aart Heijboer contents:
RooUnfold unfolding framework and algorithms Tim Adye Rutherford Appleton Laboratory Oxford ATLAS Group Meeting 13 th May 2008.
1 Iterative dynamically stabilized (IDS) method of data unfolding (*) (*arXiv: ) Bogdan MALAESCU CERN PHYSTAT 2011 Workshop on unfolding.
Measurement of the atmospheric lepton energy spectra with AMANDA-II presented by Jan Lünemann* for Kirsten Münich* for the IceCube collaboration * University.
Data collected during the year 2006 by the first 9 strings of IceCube can be used to measure the energy spectrum of the atmospheric muon neutrino flux.
Study of neutrino oscillations with ANTARES J. Brunner.
Study of neutrino oscillations with ANTARES J. Brunner.
Chapter 28 Cononical Correction Regression Analysis used for Temperature Retrieval.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:
Bayes Theorem The most likely value of x derived from this posterior pdf therefore represents our inverse solution. Our knowledge contained in is explicitly.
Unfolding in ALICE Jan Fiete Grosse-Oetringhaus, CERN for the ALICE collaboration PHYSTAT 2011 CERN, January 2011.
JET CHARGE AT LHC Scott Lieberman New Jersey Institute of Technology TUNL REU 2013 Duke High Energy Physics Group Working with: Professor Ayana Arce Dave.
Introducing Error Co-variances in the ARM Variational Analysis Minghua Zhang (Stony Brook University/SUNY) and Shaocheng Xie (Lawrence Livermore National.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: MLLR For Two Gaussians Mean and Variance Adaptation MATLB Example Resources:
2005 Unbinned Point Source Analysis Update Jim Braun IceCube Fall 2006 Collaboration Meeting.
8th December 2004Tim Adye1 Proposal for a general-purpose unfolding framework in ROOT Tim Adye Rutherford Appleton Laboratory BaBar Statistics Working.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
G. Cowan CERN Academic Training 2012 / Statistics for HEP / Lecture 41 Statistics for HEP Lecture 4: Unfolding Academic Training Lectures CERN, 2–5 April,
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Chapter 61 Chapter 7 Review of Matrix Methods Including: Eigen Vectors, Eigen Values, Principle Components, Singular Value Decomposition.
Muon Energy reconstruction in IceCube and neutrino flux measurement Dmitry Chirkin, University of Wisconsin at Madison, U.S.A., MANTS meeting, fall 2009.
IC40 Spectrum Unfolding 7/1/2016Warren Huelsnitz1 SVD Method described in A. Höcker and V. Kartvelishvili, NIM A 372 (1996) 469NIM A 372 (1996) 469 Implemented.
Canadian Bioinformatics Workshops
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION.
Chapter 3: Maximum-Likelihood Parameter Estimation
Latent Variables, Mixture Models and EM
Unfolding Problem: A Machine Learning Approach
Unfolding atmospheric neutrino spectrum with IC9 data (second update)
Filtering and State Estimation: Basic Concepts
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Unfolding performance Data - Monte Carlo comparison
Statistical Methods for Particle Physics Lecture 4: unfolding
LECTURE 15: REESTIMATION, EM AND MIXTURES
Introduction to Unfolding
Juande Zornoza UW Madison
Unfolding with system identification
Random Neural Network Texture Model
Presentation transcript:

Spectrum Reconstruction of Atmospheric Neutrinos with Unfolding Techniques Juande Zornoza UW Madison

Introduction  We will review different approaches for the reconstruction of the energy spectrum of atmospheric neutrinos:  Blobel / Singular Value Decompostion  actually, both methods are basically the same, with only differences in issues not directly related with the unfolding but with the regularization and so on  Singular Value Decomposition

Spectrum reconstruction  In principle, energy spectra can be obtained using the reconstructed energy of each event.  However, this is not efficient in our case because of the combination of two factors:  Fast decrease (power-law) of the flux.  Large fluctuations in the energy deposition.  For this reason, an alternative way has to be used: unfolding techniques.

Spectrum unfolding Quantity to obtain: y, which follows pdf → f true (y) Measured quantity: b, which follows pdf→f meas (b) Both are related by the Fredholm integral equation of first kind: Matrix notation The response matrix takes into account three factors: -Limited acceptance -Finite resolution -Transformation (in our case, b will be xlow) The response matrix inversion gives useless solutions, due to the effect of statistical fluctuations, so two alternative methods have been tried: -Singular Value Decomposition -Iterative Method based on Bayes’ Theorem

Dealing with instabilities  Regularization:  Solution with minimum curvature  Solution with strictly positive curvature  Principle of maximum entropy  Iterative procedure, leading asymptotically to the unfolded distribution

Single Value Decomposition 1  The response matrix is decomposed as: 1 A. Hoecker, Nucl. Inst. Meth. in Phys. Res. A 372:469 (1996) U, V: orthogonal matrices S: non-negative diagonal matrix (s i, singular values)  This can be also seen as a minimization problem:  Or, introducing the covariance matrix to take into account errors:

SVD: normalization  Actually, it is convenient to normalize the unknowns  A ij contains the number of events, not the probability.  Advantages:  the vector w should be smooth, with small bin-to-bin variations  avoid weighting too much the cases of 100% probability when only one event is in the bin

SVD: Rescaling  Rotation of matrices allows to rewrite the system with a covariance matrix equal to I, more convenient to work with:

Regularization  Several methods have been proposed for the regularization. The most common is to add a curvature term add a curvature term  Other option: principle of maximum entropy

Regularization  We have transformed the problem in the optimization of the value of , which tunes how much regularization we include:   too large: physical information lost   too small: statistical fluctuations spoil the result  In order to optimize the value of  :  Evaluation using MC information  Components of vector  Maximum curvature of the L-curve L-curve

Solution to the system  Actually, the solution to the system with the curvature term can be expressed as a function of the solution without curvature: where (Tikhonov factors)

Tikhonov factors  The non-zero tau is equivalet to change d i by fun2 fun0 fun1 Components of d  = s k 2 k  And this allows to find a criteria to find a good tau

Bayesian Iterative Method 2 If there are several causes (E i ) which can produce an effect X j and we know the initial probability of the causes P(E i ), the conditional probability of the cause to be E i when X is observed is: The dependence on the initial probability P 0 (E i ) can be overcome by an iterative process. The expected number of events to be assigned to each of the causes is: 2 G. D'Agostini NIM A362(1995) prior guess: iterative approach smearing matrix: MC experimental data (simulated)

Reconsructed spectrum Iterative algorithm 1.Choose the initial distribution P 0 (E). For instance, a good guess could be the atmospheric flux (without either prompt neutrinos or signal). 2.Calculate and. 3.Compare to. 4.Replace by and by. 5.Go to step 2. P(X j |E i ) P o (E) P(E i |X j ) n(X j ) n(E i )P(E i ) Initial guess n o (E) Experimental data Smearing matrix (MC)

Differences between SVD and Blobel  Different curvature term  Selection of optimum tau  B-splines used in the standard Blobel implementation  …

B-splines  Spline: piecewise continuous and differentiable function that connects two neighbor points by a cubic polynomial: from H. Greene PhD.  B-spline: spline functions can be expressed by a finite superposition of base functions (B-spilines). (higher orders)(first order)

For IceCube  Several parameters can be investigated:  Number of channels  Number of hits  Reconstructed energy  Neural network output…  With IceCube, we will have much better statistics than with AMANDA  But first, reconstruction with 9 strings will be the priority

Remarks  First, a good agreement between data and MC is necessary  Different unfolding methods will be compared (several internal parameters to tune in each method)  Several regularization techniques are also available in the literature  Also an investigation on the best variable for unfolding has to be done  Maybe several variables can be used in a multi-D analysis