Deep learning possibilities for GERDA

Slides:

Advertisements

Similar presentations

Applications of one-class classification

Advertisements

COBRA A new Approach to -Decay UK HEP Forum, Abingdon, May 11 th, 2003 Daniel Muenstermann University of Dortmund COBRA.

Digital Filtering Performance in the ATLAS Level-1 Calorimeter Trigger David Hadley on behalf of the ATLAS Collaboration.

Activity for the Gerda-specific part Description of the Gerda setup including shielding (water tank, Cu tank, liquid Nitrogen), crystals array and kapton.

Status of Ultra-low Energy HPGe Detector for low-mass WIMP search Li Xin (Tsinghua University) KIMS collaboration Oct.22nd, 2005.

Neutron background measurements at LNGS Gian Luca Raselli INFN - Pavia JRA1 meeting, Paris 14 Feb

Pulse Shape Analysis with Segmented Germanium Detector Xiang Liu, Max-Planck-Institut für Physik 1.Motivation 2.Pulse properties 3.Analysis procedure 4.Some.

Results from M. Di Marco, P. Peiffer, S. Schönert Thanks to Davide Franco and Marik Barnabe Heider Gerda collaboration meeting, Tübingen 9th-11th.

WP2 Background simulations Outline Execution plan for the third year Progress of the work Activities and news.

GERDA: GERmanium Detector Array

Particle Identification in the NA48 Experiment Using Neural Networks L. Litov University of Sofia.

An Online Calorimeter Trigger for Removing Outsiders from Particle Beam CalibrationTests Denis O. Damazio José Manoel de Seixas Signal Processing Lab –

Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.

This week: overview on pattern recognition (related to machine learning)

How to do backpropagation in a brain

Background Subtraction in Next Generation 0  Experiments Double-Beta Decay Challenges in 0  Decay Detection Small 0νββ decay half-life leads to low.

GERmanium Detector Array – a Search for Neutrinoless Double Beta Decay X. Liu - MPI für Physik, München Symposium – symmetries and phases in the universe,

Artificial Neural Network Theory and Application Ashish Venugopal Sriram Gollapalli Ulas Bardak.

A screening facility for next generation low-background experiments Tom Shutt Laura Cadonati Princeton University.

Application of Data Mining Algorithms in Atmospheric Neutrino Analyses with IceCube Tim Ruhe, TU Dortmund.

Measurement of the Atmospheric Muon Neutrino Energy Spectrum with IceCube in the 79- and 86-String Configuration Tim Ruhe, Mathis Börner, Florian Scheriau,

Planned Transregional Collaborative Research Center TR27: Neutrinos and Beyond Project A4: Development of segmented germanium detectors for the investigation.

1 IDM2004 Edinburgh, 9 september 2004 Helenia Menghetti Bologna University and INFN Study of the muon-induced neutron background with the LVD detector.

M. Wójcik for the GERDA Collaboration Institute of Physics, Jagellonian University Epiphany 2006, Kraków, Poland, 6-7 January 2006.

Béla Majorovits for the GERDA collaboration ICHEP 2012, Melbourne, Australia, July Béla Majorovits for the GERDA collaboration Status and plans.

What is MaGe? MJ outputGERDA output MaGe is a Monte Carlo simulation package dedicated to experiments searching for 0 2  decay in 76 Ge. Created by the.

Half Day IoP Meeting: Neutrinoless Double Beta Decay, University College London, Great Britain The GERDA Experiment at Gran Sasso Grzegorz Zuzel.

Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.

G4 Validation meeting (5/11/2003) S.VIRET LPSC Grenoble Photon testbeam Data/G4 comparison  Motivation  Testbeam setup & simulation  Analysis & results.

M. Wójcik Instytut Fizyki, Uniwersytet Jagielloński Instytut Fizyki Doświadczalnej, Uniwersytet Warszawski Warszawa, 10 Marca 2006.

MaGe framework for Monte Carlo simulations MaGe is a Geant4-based Monte Carlo simulation package dedicated to experiments searching for 0 2  decay of.

NEMO3 analysis and SuperNEMO development Benjamin Richards D14.

BACKGROUND REJECTION AND SENSITIVITY FOR NEW GENERATION Ge DETECTORS EXPERIMENTS. Héctor Gómez Maluenda University of Zaragoza (SPAIN)

Activities on double beta decay search experiments in Korea 1.Yangyang Underground laboratory 2.Double beta decay search with HPGe & CsI(Tl) 3.Metal Loaded.

Results of the NEMO-3 experiment (Summer 2009) Outline   The  decay  The NEMO-3 experiment  Measurement of the backgrounds   and  results.

Background Subtraction in Next Generation 0  Experiments Double-Beta Decay Challenges in 0  Decay Detection Benjamin Spaun Whitworth College Advisors:

1 LTR 2004 Sudbury, December 2004 Helenia Menghetti, Marco Selvi Bologna University and INFN Large Volume Detector The Large Volume Detector (LVD)

GERDA – a Search for Neutrinoless Double Beta Decay MPI für Physik, München Neutrinoless double beta decay and the GERDA experimentThe detector array and.

Phase I: Use available 76 Ge diodes from Heidelberg- Moscow and IGEX experiments (~18 kg). Scrutinize with high siginificance current evidence. Phase II:

5th June 2003, NuFact03 Kengo Nakamura1 Solar neutrino results, KamLAND & prospects Solar Neutrino History Solar.

PyungChang 2006/02/06 HYUNSU LEE CsI(Tl) crystals for WIMP search Hyun Su Lee Seoul National University (For The KIMS Collaboration)

09/04/2006NDM061 CANDLES for the study of 48 Ca double beta decay OGAWA Izumi ( 小川泉 ) Osaka Univ. ( 大阪大学 )

Towards pulse shape calculation and analysis for the GERDA experiment Kevin Kröninger, MPI für Physik International School of Nuclear Physics, Erice 2005.

Heidelberg-Moscow and Evidence for 0nubb? Daniel Lenz

ICARUS T600: low energy electrons

B. Majorovits (MPI für Physik) for the collaboration

IBD Detection Efficiencies and Uncertainties

Daniel Mazin and Nadia Tonello Max-Planck-Institut für Physik München

Machine Learning for Data Certification at CMS

Overview of GERDA simulation activities with MaGe

Pulse-shape discrimination with Cs2HfCl6 crystal

DeepFont: Identify Your Font from An Image

Prompt Gamma Activation Analysis on 76Ge

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

GERDA Collaboration Meeting,

Ioannis Manthos Laboratory of Nuclear & Particle Physics

techniques and studies

Restricted Boltzmann Machines for Classification

Sr-84 0n EC/b+ decay search with SrCl2 crystal

Neural networks (3) Regularization Autoencoder

Unsupervised Learning and Autoencoders

Face Recognition and Detection Using Eigenfaces

INF 5860 Machine learning for image classification

Status of Neutron flux Analysis in KIMS experiment

Separation of Cosmic-Ray Components in Water Cherenkov Detector

Neural networks (3) Regularization Autoencoder

Introduction to Neural Networks

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

GERDA Test Stands for Segmented Germanium Detectors

Support Vector Machines 2

Presentation transcript:

Deep learning possibilities for GERDA Philipp Holl, Max-Planck-Institut für Physik 21.07.2017

Neutrinoless double beta decay (0𝜈𝛽𝛽) Only possible if neutrino has Majorana component Could help explain matter dominance in universe

GERDA: Search for 0𝜈𝛽𝛽 in Ge 40 kg of enriched Germanium-76 Underground laboratory @ Gran Sasso, Italy

Rejecting background events Event origins Discrimination Signal? (<1 event/yr) Energy Rejecting background events Muons Muon detectors Radioactive decays Liquid argon scintillation Pulse shape discrimination (PSD)

Issues of current analysis Long cables Issues of current analysis Noise from long cables impacts PSD performance PSD not stable over time How can we improve performance and get rid of existing issues?

Particle reconstruction https://commons.wikimedia.org/wiki/File:Sound_wave.jpg https://www.flickr.com/photos/downloadsourcefr/16361602656 https://commons.wikimedia.org/wiki/File:CMS_Higgs-event.jpg Speech recognition Text recognition Why deep learning? Image classification Particle reconstruction (1 min) Deep learning is about recognizing patterns

Why smart preprocessing is needed Naïve approach Why smart preprocessing is needed

Training a neural network Calibration spectrum Compton scattering Bi, Tl,… decay lines (relatively pure signal/bkg-like) Peaks can be used as proxies for signal-like and background-like events

Training a neural network Classifier Preprocessing Training a neural network Network remembered data Train a neural network on the labelled pulses Network converges to stable solution with good performance. Problem: Bad performance on new data

Fighting overfitting > 5000 parameters ~140 parameters … ~1 year of data taking ~10.000 events per class per detector Split into training/validation/test sets ~5000 left for training Rule of thumb: 10-100 times more data than learnable parameters Fighting overfitting How big can the network be? 50 inputs 50 > 5000 parameters The cross validation curve was rising from the beginning Clever algorithm reduces from 1000 to 50 7 inputs 10 5 ~140 parameters Can such a tiny network do efficient classification ? Yes How can we get an event down to 7 parameters? …

Unsupervised information reduction Autoencoder Unsupervised information reduction

Autoencoder magic Autoencoder ~1.000.000 events total (unlabelled) ~10.000 events per peak (labelled) Autoencoder Deep learning based algorithm that learns to compress data No labels required

How it works Determine number of compact parameters It turns out 7 is pretty good Setup network with random initialization Train on all pulses

Encoder & decoder Encoder & decoder are trained from data 7 Encoder 7 Decoder Encoder & decoder Encoder & decoder are trained from data Unbiased compact representation Compact representation hard to visualize What information is being stored?

Information reduction to 7 parameters ANG1 - Calibration data above 1 MeV from run53-64 Encoder Information reduction to 7 parameters Decoder Noise is discarded as it follows no pattern … except when does Reconstructions contain less information Only the most relevant data is stored Noise is discarded as it follows no pattern

Correlated noise Some detectors show highly correlated noise ANG5 - Calibration data above 1 MeV from run53-64 Encoder Correlated noise Decoder Some detectors show highly correlated noise Reconstructions capture the important features of the pulse very well

GD00A - Calibration data above 1 MeV from run53-64 Encoder Accurate peak height Decoder Peak height for signal-like events very accurate for BEGes

Classification after autoencoder preprocessing Combined analysis Classification after autoencoder preprocessing

Combined PSD Preprocessing encoder Classifier 7 Classifier Combined PSD Train neural network on compact representation Few learnable parameters required, less overfitting

Signal-background classification GD00A - Calibration data above 1 MeV from run53-64 Signal-background classification 7 inputs 10 5 Tiny classifier with 141 parameters Trained on Tl (signal) and Bi peak (background)

Performance on calibration data 1 2 3 4 5 6 7 Performance equal to current methods Less dependent on noise No manual corrections required

Performance on physics data

Time stability Almost all detectors show no significant deviation over time

Summary & Outlook New method for pulse shape analysis Bias-free, nonlinear information reduction with autoencoder Competitive compared to current methods No time-dependent corrections required Relatively independent of noise level Future possibilities Training on simulation data A/E or energy reconstruction after noise-filtering by autoencoder Summary & Outlook

Backup

Can we go lower than 7? 1.0 0.0 Single parameter autoencoder 0.12 0.82 1.0 0.0 0.51 Can we go lower than 7? 1 Decoder Single parameter autoencoder “A/E for coax detectors”

Peak efficiencies Background peaks have very similar performance Coax Peak efficiencies BEGe Peak fit model: Gaussian + first order polynomial ANG1 & GD00A, test set of run 53-64 Background peaks have very similar performance Tl SEP usually has best rejection

Efficiencies on calibration data Coax Efficiencies on calibration data ANG5 - Calibration data above 1 MeV from run53-64 BEGe GD00A - Calibration data above 1 MeV from run53-64 Results competitive with current implementations

Efficiencies on calibration data Coax Efficiencies on calibration data ANG1 - Calibration data above 1 MeV from run53-64 BEGe GD91C - Calibration data above 1 MeV from run53-64 Especially good performance for detectors with high noise level

GD00A - Calibration data above 1 MeV from run53-64 Energy dependence DEP Cut Bi FEP SEP FEP Only trained on DEP and FEP Clear separation between SSE and MSE on all energy scales

Mean efficiencies DEP FEP SEP FEP Comp. 2𝜈𝛽𝛽 Efficiencies evaluated on 40% of Phase IIb Mean of all detectors that are used for current analysis Mean efficiencies DEP FEP SEP FEP Comp. 2𝜈𝛽𝛽 90% 18% (17%) 13% (12%) 16% (15%) 52% (44%) 86% (84%) 90% 43% (41%) 47% (45%) 45% (45%) 65% (63%) 79% (77%) PNN (Current) BEGe Coax 2𝜈𝛽𝛽 from 1 to 1.3 MeV Compton at 𝑄 𝛽𝛽 ±30 keV Performance similar to current methods Almost all differences can be explained by different algorithm to set cut value