Speed-up of the ring recognition algorithm Semeon Lebedev GSI, Darmstadt, Germany and LIT JINR, Dubna, Russia Gennady Ososkov LIT JINR, Dubna, Russia.

Slides:



Advertisements
Similar presentations
DEVELOPMENT OF ONLINE EVENT SELECTION IN CBM DEVELOPMENT OF ONLINE EVENT SELECTION IN CBM I. Kisel (for CBM Collaboration) I. Kisel (for CBM Collaboration)
Advertisements

OPTIMIZING C CODE FOR THE ARM PROCESSOR Optimizing code takes time and reduces source code readability Usually done for functions that are critical for.
Intel Pentium 4 ENCM Jonathan Bienert Tyson Marchuk.
MP3 Optimization Exploiting Processor Architecture and Using Better Algorithms Mancia Anguita Universidad de Granada J. Manuel Martinez – Lechado Vitelcom.
L1 Event Reconstruction in the STS I. Kisel GSI / KIP CBM Collaboration Meeting Dubna, October 16, 2008.
Computing EVO meeting, January 15 th 2013 Status of the Tracking Code Gianluigi Boca, Pavia University.
1 Geometry layout studies of the RICH detector in the CBM experiment Elena Belolaptikova Dr. Claudia Hoehne Moscow State Institute of Radioengineering,
HLT - data compression vs event rejection. Assumptions Need for an online rudimentary event reconstruction for monitoring Detector readout rate (i.e.
E.Papandrea PM3 - Paris, 2 nd Mar 2004 DFCI COMPUTING PERFORMANCEPage 1 Enzo Papandrea COMPUTING PERFORMANCE.
ALICE HLT High Speed Tracking and Vertexing Real-Time 2010 Conference Lisboa, May 25, 2010 Sergey Gorbunov 1,2 1 Frankfurt Institute for Advanced Studies,
K-Ary Search on Modern Processors Fakultät Informatik, Institut Systemarchitektur, Professur Datenbanken Benjamin Schlegel, Rainer Gemulla, Wolfgang Lehner.
CA+KF Track Reconstruction in the STS I. Kisel GSI / KIP CBM Collaboration Meeting GSI, February 28, 2008.
KIP TRACKING IN MAGNETIC FIELD BASED ON THE CELLULAR AUTOMATON METHOD TRACKING IN MAGNETIC FIELD BASED ON THE CELLULAR AUTOMATON METHOD Ivan Kisel KIP,
Online Event Selection in the CBM Experiment I. Kisel GSI, Darmstadt, Germany RT'09, Beijing May 12, 2009.
Online Track Reconstruction in the CBM Experiment I. Kisel, I. Kulakov, I. Rostovtseva, M. Zyzak (for the CBM Collaboration) I. Kisel, I. Kulakov, I. Rostovtseva,
CA tracker for TPC online reconstruction CERN, April 10, 2008 S. Gorbunov 1 and I. Kisel 1,2 S. Gorbunov 1 and I. Kisel 1,2 ( for the ALICE Collaboration.
Many-Core Scalability of the Online Event Reconstruction in the CBM Experiment Ivan Kisel GSI, Germany (for the CBM Collaboration) CHEP-2010 Taipei, October.
Helmholtz International Center for CBM – Online Reconstruction and Event Selection Open Charm Event Selection – Driving Force for FEE and DAQ Open charm:
Elastic Neural Net for standalone RICH ring finding Sergey Gorbunov Ivan Kisel DESY, Zeuthen KIP, Uni-Heidelberg DESY, Zeuthen KIP, Uni-Heidelberg KIP.
YOU LI SUPERVISOR: DR. CHU XIAOWEN CO-SUPERVISOR: PROF. LIU JIMING THURSDAY, MARCH 11, 2010 Speeding up k-Means by GPUs 1.
History of Microprocessor MPIntroductionData BusAddress Bus
Hough Transform : A preliminary study Indranil Das HEP Devn., SINP.
© 2007 SET Associates Corporation SAR Processing Performance on Cell Processor and Xeon Mark Backues, SET Corporation Uttam Majumder, AFRL/RYAS.
Fast reconstruction of tracks in the inner tracker of the CBM experiment Ivan Kisel (for the CBM Collaboration) Kirchhoff Institute of Physics University.
Fast track reconstruction in the muon system and transition radiation detector of the CBM experiment at FAIR Andrey Lebedev 1,2 Claudia Höhne 3 Ivan Kisel.
TRD and Global tracking Andrey Lebedev GSI, Darmstadt and LIT JINR, Dubna Gennady Ososkov LIT JINR, Dubna X CBM collaboration meeting Dresden, 27 September.
G.Ososkov NEC Pattern recognition methods for data handling of the CBM experiment Gennady Ososkov LIT JINR, Dubna, Russia Andrey and Semeon.
Status of Reconstruction in CBM
A.Ayriyan 1, V.Ivanov 1, S.Lebedev 1,2, G.Ososkov 1 in collaboration with N.Chernov 3 1 st CBM Collaboration Meeting, JINR Duba,19-22 May JINR-LIT,
KIP Ivan Kisel JINR-GSI meeting Nov 2003 High-Rate Level-1 Trigger Design Proposal for the CBM Experiment Ivan Kisel for Kirchhoff Institute of.
Standalone FLES Package for Event Reconstruction and Selection in CBM DPG Mainz, 21 March 2012 I. Kisel 1,2, I. Kulakov 1, M. Zyzak 1 (for the CBM.
December 2, 2015Single-Instruction Multiple Data (SIMD)1 Performance Optimization, cont. How do we fix performance problems?
HONGIK UNIVERSITY School of Radio Science & Communication Engineering Visual Information Processing Lab Hong-Ik University School of Radio Science & Communication.
Introduction to MMX, XMM, SSE and SSE2 Technology
Track Finding based on a Cellular Automaton Ivan Kisel Kirchhoff-Institut für Physik, Uni-Heidelberg Tracking Week, GSI January 24-25, 2005 KIP.
Introdution to SSE or How to put your algorithms on steroids! Christian Kerl
GEM: A Framework for Developing Shared- Memory Parallel GEnomic Applications on Memory Constrained Architectures Mucahid Kutlu Gagan Agrawal Department.
Ring Recognition and Electron Identification in the RICH detector of the CBM Experiment at FAIR Semeon Lebedev GSI, Darmstadt, Germany and LIT JINR, Dubna,
First Level Event Selection Package of the CBM Experiment S. Gorbunov, I. Kisel, I. Kulakov, I. Rostovtseva, I. Vassiliev (for the CBM Collaboration (for.
Methods for fast reconstruction of events Ivan Kisel Kirchhoff-Institut für Physik, Uni-Heidelberg FutureDAQ Workshop, München March 25-26, 2004 KIP.
Fast Tracking of Strip and MAPS Detectors Joachim Gläß Computer Engineering, University of Mannheim Target application is trigger  1. do it fast  2.
Track reconstruction in TRD and MUCH Andrey Lebedev Andrey Lebedev GSI, Darmstadt and LIT JINR, Dubna Gennady Ososkov Gennady Ososkov LIT JINR, Dubna.
Global Tracking for CBM Andrey Lebedev 1,2 Ivan Kisel 1 Gennady Ososkov 2 1 GSI Helmholtzzentrum für Schwerionenforschung GmbH, Darmstadt, Germany 2 Laboratory.
Cellular Automaton Method for Track Finding (HERA-B, LHCb, CBM) Ivan Kisel Kirchhoff-Institut für Physik, Uni-Heidelberg Second FutureDAQ Workshop, GSI.
Muon detection in the CBM experiment at FAIR Andrey Lebedev 1,2 Claudia Höhne 1 Ivan Kisel 1 Anna Kiseleva 3 Gennady Ososkov 2 1 GSI Helmholtzzentrum für.
CA+KF Track Reconstruction in the STS S. Gorbunov and I. Kisel GSI/KIP/LIT CBM Collaboration Meeting Dresden, September 26, 2007.
Kalman Filter based Track Fit running on Cell S. Gorbunov 1,2, U. Kebschull 2, I. Kisel 2,3, V. Lindenstruth 2 and W.F.J. Müller 1 1 Gesellschaft für Schwerionenforschung.
Compact RICH detector Claudia Höhne, GSI Darmstadt.
1/13 Future computing for particle physics, June 2011, Edinburgh A GPU-based Kalman filter for ATLAS Level 2 Trigger Dmitry Emeliyanov Particle Physics.
Ring finding efficiencies and RICH event display Semeon Lebedev GSI, Darmstadt and LIT JINR, Dubna Gennady Ososkov LIT JINR, Dubna X CBM collaboration.
Fast parallel tracking algorithm for the muon detector of the CBM experiment at FAIR Andrey Lebedev 1,2 Claudia Höhne 1 Ivan Kisel 1 Gennady Ososkov 2.
AliRoot survey: Reconstruction P.Hristov 11/06/2013.
Parallel Implementation of the KFParticle Vertexing Package for the CBM and ALICE Experiments Ivan Kisel 1,2,3, Igor Kulakov 1,4, Maksym Zyzak 1,4 1 –
LIT participation LIT participation Ivanov V.V. Laboratory of Information Technologies Meeting on proposal of the setup preparation for external beams.
Track Reconstruction in MUCH and TRD Andrey Lebedev 1,2 Gennady Ososkov 2 1 Gesellschaft für Schwerionenforschung, Darmstadt, Germany 2 Laboratory of Information.
Gianluigi, 10 Sep 2012 Paris, September 10 th 2012 Status of the Pattern Recognition Code Gianluigi Boca GSI & Pavia University 1.
IPHC, Strasbourg / GSI, Darmstadt
Event reconstruction for the RICH prototype beamtest data 2014
Y. Fisyak1, I. Kisel2, I. Kulakov2, J. Lauret1, M. Zyzak2
Kalman filter tracking library
Fast Parallel Event Reconstruction
Status of the Offline Pattern Recognition Code GSI, December 10th 2012
ALICE HLT tracking running on GPU
TPC reconstruction in the HLT
Simulation results for the upgraded RICH detector in the HADES experiment. Semen Lebedev1,3, Jürgen Friese2, Claudia Höhne1, Tobias Kunz2, Jochen Markert4.
MMX Multi Media eXtensions
Hough Based Tracking For Low Pt In MDC
Individual Particle Reconstruction
STUDY AND IMPLEMENTATION
road – Hough transform- c2
Presentation transcript:

Speed-up of the ring recognition algorithm Semeon Lebedev GSI, Darmstadt, Germany and LIT JINR, Dubna, Russia Gennady Ososkov LIT JINR, Dubna, Russia

Speed up of the ring finder Dubna, Motivation Fast algorithm -> less computers requirements Possibility to use on-line reconstruction Many cores CPUs -> algorithms can be parallelized I. Kisel, March 2009, CBM Coll. Meeting

Speed up of the ring finder Dubna, Ring recognition algorithm Global search. Filter: algorithm compares all ring-candidates and chooses only good rings, rejecting clone and fake rings. Standalone ring finder. Local search of ring- candidates, based on local selection of hits and Hough Transform. Two steps: 99% 1% Time consumption

Speed up of the ring finder Dubna, Ring recognition algorithm, local search Preliminary selection of hits Histogram of ring centers HoughTransform Ellipse fitter Ring quality calculation Remove hits of found ring (only best matched hits) Ring array

Speed up of the ring finder Dubna, Time consumption Define local area and hitsHough TransformPeak finder 30%69% 1% Time consumption Hits search Arrays initialization Triple loop of ring parameters calculation peak finding in 2D and 1D array Optimize hits search and arrays sizes and dimensions, remove dynamic memory allocation Optimize calculations inside loops, decrease combinatory Where?

Speed up of the ring finder Dubna, Optimization of Hough Transform Divide hits into a several parts Make Hough Transform of each part independently First part of hits Second part of hits Hough Transform Sum up histogram

Speed up of the ring finder Dubna, Optimization of Hough Transform: SIMD and SSE2 SSE 128-bit registers can represent: sixteen 8-bit signed or unsigned chars, eight 16-bit signed or unsigned shorts, four 32-bit integers, or four 32-bit floating point variables. 128 bit register Four concurrent add operations Algorithm must work with single precision type (float)

Speed up of the ring finder Dubna, Ring finder and SIMD SIMD version of CalculateRingParameters(x[3], y[3], &xc, &yc, &r) was implemented. CalculateRingParameters(x[3], y[3], &xc, &yc, &r), where x, y, xc, yc, r are floats CalculateRingParameters(xv[3], yv[3], &xcv, &ycv, &rv), where xv, yv, xcv, ycv, rv are F32vec4

Speed up of the ring finder Dubna, Optimization and performance Time per 1 events, ms CommentsInitial versionDouble->float Refactoring + base class for HT Remove modf() SIMD Hits presearch Remove dynamic allocation SIMD Divide hits into several parts SIMD Calc. ring params inside loop Change division to multiplication RF parameters optimization Speed up factor: 7.7 Processor Intel Pentium Core GHz

Speed up of the ring finder Dubna, Electron ring finding efficiency Au-Au central collision at 25 AGev plus 5e+ and 5e- Compact RICH geometry

Speed up of the ring finder Dubna, Summary Ring finder was significantly optimized in terms of calculation speed without loosing an efficiency Next step: –HT parameters optimization –Parallelization on multi core CPU –Continue investigation of SIMD version